Search CORE

18 research outputs found

Semantic Road Segmentation via Multi-scale Ensembles of Learned Features

Author: A. Gupta
C. Zhang
G.J. Brostow
H. Cecotti
P.F. Felzenszwalb
Ľ. Ladický
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

International Migration, Integration and Social Cohesion online publications

Responsive Action-Based Video Synthesis

Author: Brostow G.J.
Campbell Neill D. F.
Ilisescu Corneliu
Kanaci Aytac
Romagnoli Matteo
Publication venue
Publication date: 30/01/2017
Field of study

We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-end approach enables previewing and easy iteration. We convert static-camera videos into loopable sequences, synthesizing them in response to simple end-user requests. This is hard because a) users want essentially semantic-level control over the synthesized video content, and b) automatic loop-finding is brittle and leaves users limited opportunity to work through problems. We propose a human-in-the-loop system where adding effort gives the user progressively more creative control. Artists help us evaluate how our trigger interfaces can be used for authoring of videos and video-performances

OPUS

UCL Discovery

Visual Dictionary Learning for Joint Object Categorization and Segmentation

Author: A. Vedaldi
C. Russell
G.J. Brostow
H. Zhang
I. Tsochantaridis
J. Pearl
P. Kohli
V. Kolmogorov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Road Scene Segmentation from a Single Image

Author: A. Lookingbill
A. Saxena
D. Geronimo
D. Hoiem
G.J. Brostow
H. Cecotti
J.M. Alvarez
K. Saenko
L. Duan
S.C. Turaga
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

International Migration, Integration and Social Cohesion online publications

Joint Semantic Segmentation and 3D Reconstruction from Monocular Video

Author: A. Saxena
C. Sutton
D. Hoiem
G. Brostow
G.J. Brostow
G.J. Brostow
M. Kaess
M.J. Wainwright
P. Kohli
Y. Furukawa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

© Springer International Publishing Switzerland 2014. The original publication is available at www.springerlink.comDOI: 10.1007/978-3-319-10599-4_45We present an approach for joint inference of 3D scene structure and semantic labeling for monocular video. Starting with monocular image stream, our framework produces a 3D volumetric semantic + occupancy map, which is much more useful than a series of 2D semantic label images or a sparse point cloud produced by traditional semantic segmentation and Structure from Motion(SfM) pipelines respectively. We derive a Conditional Random Field (CRF) model defined in the 3D space, that jointly infers the semantic category and occupancy for each voxel. Such a joint inference in the 3D CRF paves the way for more informed priors and constraints, which is otherwise not possible if solved separately in their traditional frameworks. We make use of class specific semantic cues that constrain the 3D structure in areas, where multiview constraints are weak. Our model comprises of higher order factors, which helps when the depth is unobservable. We also make use of class specific semantic cues to reduce either the degree of such higher order factors, or to approximately model them with unaries if possible. We demonstrate improved 3D structure and temporally consistent semantic segmentation for diffcult, large scale, forward moving monocular image sequence

Scholarly Materials And Research @ Georgia Tech

CiteSeerX

Crossref

Co-inference for Multi-modal Scene Analysis

Author: C. Zhang
D. Munoz
G.J. Brostow
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Acquisition of Articulated Human Body Models Using Multiple Cameras

Author: A. Elad
A. Ozaslan
G.J. Brostow
J.B. Tenenbaum
M. Belkin
N.I. Badler
S.T. Roweis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Crossref

Semantic Classification in Aerial Imagery by Integrating Appearance and Height Information

Author: D.G. Lowe
F. Lafarge
G.J. Brostow
J. Shotton
L. Zebedin
L. Zebedin
N. Komodakis
Publication venue
Publication date: 28/12/2009
Field of study

In this paper we present an efficient technique to obtain accurate semantic classification on the pixel level capable of integrating various modalities, such as color, edge responses, and height information. We propose a novel feature representation based on Sigma Points computations that enables a simple application of powerful covariance descriptors to a multi-class randomized forest framework. Additionally, we include semantic contextual knowledge using a conditional random field formulation. In order to achieve a fair comparison to state-of-the-art methods our approach is first evaluated on the MSRC image collection and is then demonstrated on three challenging aerial image datasets Dallas, Graz, and San Francisco. We obtain a full semantic classification on single aerial images within two minutes. Moreover, the computation time on large scale imagery including hundreds of images is investigated

CiteSeerX

Crossref

Beyond the Line of Sight: Labeling the Underlying Surfaces

Author: A. Gupta
D. Hoiem
G.J. Brostow
H. Zhang
J. Shotton
J. Tighe
S. Gould
V. Kolmogorov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Sparse dictionaries for semantic segmentation

Author: A. Jain
A. Vedaldi
C.N.J. Yu
F. Bach
G.J. Brostow
I. Laptev
J. Shotton
M. Everingham
N. Naikal
S. Gould
T. Joachims
Ľ. Ladický
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A popular trend in semantic segmentation is to use top-down object information to improve bottom-up segmentation. For instance, the classification scores of the Bag of Features (BoF) model for image classification have been used to build a top-down categorization cost in a Conditional Random Field (CRF) model for semantic segmentation. Recent work shows that discriminative sparse dictionary learning (DSDL) can improve upon the unsupervised K-means dictionary learning method used in the BoF model due to the ability of DSDL to capture discriminative features from different classes. However, to the best of our knowledge, DSDL has not been used for building a top-down categorization cost for semantic segmentation. In this paper, we propose a CRF model that incorporates a DSDL based top-down cost for semantic segmentation. We show that the new CRF energy can be minimized using existing efficient discrete optimization techniques. Moreover, we propose a new method for jointly learning the CRF parameters, object classifiers and the visual dictionary. Our experiments demonstrate that by jointly learning these parameters, the feature representation becomes more discriminative and the segmentation performance improves with respect to that of state-of-the-art methods that use unsupervised K-means dictionary learning

Crossref

The Australian National University